Point Information Gain and Multidimensional Data Analysis

نویسندگان

  • Renata Rychtáriková
  • Jan Korbel
  • Petr Machácek
  • Petr Císar
  • Jan Urban
  • Dalibor Stys
چکیده

We generalize the point information gain (PIG) and derived quantities, i.e., point information gain entropy (PIE) and point information gain entropy density (PIED), for the case of the Rényi entropy and simulate the behavior of PIG for typical distributions. We also use these methods for the analysis of multidimensional datasets. We demonstrate the main properties of PIE/PIED spectra for the real data with the examples of several images and discuss further possible utilizations in other fields of data processing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Point Divergence Gain and Multidimensional Data Sequences Analysis

We introduce novel information-entropic variables—a Point Divergence Gain (Ω α ), a Point Divergence Gain Entropy (Iα), and a Point Divergence Gain Entropy Density (Pα)—which are derived from the Rényi entropy and describe spatio-temporal changes between two consecutive discrete multidimensional distributions. The behavior of Ω α is simulated for typical distributions and, together with Iα and ...

متن کامل

PARSIMONY: An Infrastructure for Parallel Multidimensional Analysis and Data Mining

Multidimensional analysis and online analytical processing (OLAP) operations require summary information on multidimensional data sets. Most common are aggregate operations along one or more dimensions of numerical data values. Simultaneous calculation of multidimensional aggregates are provided by the Data Cube operator, used to calculate and store summary information on a number of dimensions...

متن کامل

Modified Approach for Classifying Multi-Dimensional Data-Cube Through Association Rule Mining for Granting Loans in Bank

In this paper, modified Approach for classifying Multi-dimensional data cube is constructed. It explores data cubes in large Multi-Dimensional Schema. Numerical and Nominal attributes are categorized based on Principal Component Analysis. Semantic relationships are identified by applying Multidimensional scaling. Additionally, AR is integrated for finding the inserting measures. Many algorithms...

متن کامل

ERPCA: A Novel Approach for Risk Evaluation of Multidimensional Risk Prediction Clustering Algorithm

Clustering is a data mining technique used to place data elements into related groups without advance knowledge of the group definitions. In this paper clustering is employed to support efficient decision making by clustering mass storage data available in banks. Risk assessment is important task of Banks, as the failure and success of the Bank depends largely on banks’ ability to evaluate the ...

متن کامل

Measurement of Multivariate Scaling and Factorization in Exclusive Multiparticle Production

Several new model independent techniques for the analysis of multidimensional data are presented and applied to exclusive multiparticle production. For the reaction pp-+pprr'n+~c-~from 12 to 28 GeV/c, an algorithm that directly compares two multidimensional point distributions is used to show that the shape of d120/d($?)12 is energy independent while d60/dx6 (x=p,,/pmax) varies dramatically wit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Entropy

دوره 18  شماره 

صفحات  -

تاریخ انتشار 2016